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The free energy of globular protein chain is considered to be a functional defined 
on smooth curves in three dimensional Euclidean space. From the requirement of 
geometrical invariance, together with basic facts on conformation of helical proteins 
and dynamical characteristics of the protein chains, we are able to determine, in a 
unique way, the exact form of the free energy functional. Namely, the free energy 
density should be a linear function of the curvature of curves on which the free 
energy functional is defined. This model can be used, for example, in Monte Carlo 
simulations of exhaustive searching the native stable state of the protein chain. 



1. Introduction 

A fascinating and open question challenging physics, biochemistry and even 
geometry is the presence of highly regular motifs such as a — helices and 
(3 — sheets in the folded state of biopolymers and proteins. A wide range 
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of approaches have been proposed to rationalize the existence of such sec- 
ondary structures (see, for example, reviews 1,2 and references therein). 

We propose a pure geometrical approach 3,4 ' 5 to describe the free en- 
ergy of proteins, proceeding from the most general invariance requirements 
and basic experimental facts concerning the protein conformation. Taking 
into account the one-dimensional nature of the protein chains, the relevant 
macroscopic free energy F should be considered as a functional defined on 
smooth curves x(s) (or paths) in the three dimensional Euclidean space 



where s is the length of a protein molecule. The reparametrization invari- 
ance of the functional F demands the free energy density / to be a scalar 
function depending on the geometrical invariants of the position vector x(s), 
which describes the spatial shape of the protein chain. In three dimensional 
ambient space a smooth curve has two local invariants: curvature k(s) and 
torsion k(s). In the general case of D dimensional Euclidean embedding 
space there are D — 1 principal curvatures k a (s),a — 1,2...., D — 1 of a 
curve, where fci(s) = k(s) and k2(s) — k(s). 

The first principal curvature, or simply the curvature, k\(s) = k(s) of 
a curve characterizes the local bending of the curve at the point s. Hence, 
the dependence of free energy density / on k(s) specifies the resistance 
of a protein chain to be bent. The second curvature or torsion k(s) is 
determined by the relative rotation, around the tangent dx(s)/ds at the 
point s, of two neighbor infinitely short elements of the protein chain. It 
is well known 1,2 that, in the case of protein molecules, such a rotation is 
quite easy, as it requires little effort. In other words, this rotation results in 
small energy differences, allowing many overall conformations of a protein 
chain to arise. Thus the dependence of the free energy density / on torsion 
k(s) can be neglected at least as a first approximation. Finally one can 
consider the free energy density / to be a function only of the curvature k(s) , 
i.e., / = f(k(s)). In what follows we shall try to specify this dependence 
explicitly keeping in mind the description of globular protein conformation. 

A peculiarity of conformation of globular proteins is that they can be 
ordered assemblies either of helices or of sheets as well as a mixture of he- 
lices and sheets 1,2 . In the phenomenological macroscopic approach, which 
is developed here, the presence of sheets in the spatial structure of globular 
proteins implies the necessity to introduce, in addition to space curves x(s), 
new dynamical variables y(s, s') describing surfaces in ambient space. Ob- 
viously such an extension of the problem setting would complicate consid- 
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erably our consideration. Therefore we confine ourselves to helical proteins 
and try to answer the question: Is it possible to specify the function f(k(s)) 
in such a way that the extremals of the functional F = J f ds would be only 
helices? The answer to this question turns out to be positive and unique, 
namely, the density of the free energy f(k(s)) should be a linear function 
of the curvature k(s). We sketch here the proof of this assertion (for details 
see Ref. 6 > 7 < 8 ). 



2. Euler-Lagrange equations in terms of principal 
curvatures and their exact integrability 

For an arbitrary functional F defined on curves x l (s) in D-dimensional 
space the Euler-Lagrange equations are a set of exactly D equations 

g=0, i = l,2,. ..,£>. (2) 

However, if the functional F depends only on the curvature 

F = f f(k(s))da, (3) 

then D equations (2.3) for D variables x l (s), i — 1, 2, ... ,D give D — 1 
equations for the principal curvatures k a (s), a = 1,2, ...,£) — 1 

-J (f'ih)) = -{kl- kl) f'(h) + h f(h) , (4) 

2^(/'(fc 1 )fc 2 ) = k'if'ih), (5) 

*&(*) = M«) = ••• = k D -i(s) =0. (6) 

Thus, in the problem under consideration there are two non trivial equa- 
tions (4) and (5) for the curvatures fci(s) and fc 2 (s). Equation (5) can be 
integrated with arbitrary free energy density /(fci) 

(f'(k 1 )) 2 k 2 =C, (7) 

where C is an integration constant. 

Relation (7) enables one to eliminate the torsion &2(s) from Eq. (4). As 
a result we are left with one nonlinear differential equation of the second 
order for the curvature k\{s) 



d s^'^ + ^-WQW) f,{kl) - klf{kl) = °- (8) 

Having resolved this equation for k\(s), one can determine the rest of 
curvatures by making use of Eqs. (7) and (6). Integration of the respective 
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Frcnct equations with principal curvatures found enables one to recover the 
curve x(s) itself. 

Notwithstanding its nonlinear character, Eq. (8) can be integrated in 
quadratures for arbitrary function /(fci). To show this, the first integral for 
this equation can be constructed proceeding from the symmetry properties 
of the variational problem under study 6 

M 2 = (f'ih) h - /(fcr)) 2 + + (k[f(f" (fci)) 2 . (9) 

By direct differentiation of Eq. (9) with respect to s one can be convinced 
that, for 



/"(fci)^O, 



(10) 



the relation (9) is an integral of the nonlinear differential equation (8), 
which determines the curvature of a stationary curve. From (9) we deduce 



where 



9(h) = 



(/"(fci)) 2 



M 2 - 



C 2 



cm)) 2 



-(fci /'(**)- /(**))' 



Integration of Eq. (11) gives 



rki(s) 

J fein 1 



dk 



= ± (s - s ) 



(ll) 



(12) 



(13) 



with fcio — fci(so)- 

Thus, if the free energy density /(fci) obeys the condition (10), then the 
curvature fci(s) and the torsion fc 2 (s) of the stationary curve x(s) are the 
functions of the parameter s defined by Eqs. (13) and (7). The case when 
the condition (10) is not satisfied, i.e., when the free energy density /(fci) 
is a linear function of the curvature ki(s), will be considered below. 

Now we are going to fix the function /(fci), requiring that all the solu- 
tions to the Euler - Lagrange equations are helices. From the differential 
geometry of curves it is known that the helices in three dimensional space 
have a constant curvature (ki) and a constant torsion (fc 2 ) which determine 
the radius R and the step d of a helix 



R 



k\ 



k\ 



d- 



27r|fc 2 
k\ 



k\ 



(14) 
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Under the condition (10) the curvature ki and the torsion fc 2 of the sta- 
tionary curve are not constant but they are the functions of the parameter 
s which are defined in Eqs. (13) and (7). Hence, for the free energy density 
/(fci) we are looking for, we have to consider the case, when 

f"(h) = 0, (15) 

i.e., f(ki) is a linear function of the curvature fci(s) 

f(k 1 ) = a + /3k 1 (s), (16) 

where a and (3 are constants. Indeed, substituting Eq. (16) into (7) and 

(8) we obtain the constant curvatures k\ and fc 2 : 

fc = "^. k > = ^- W 

Since k\(s) = |x"(s)| > 0, the constants a and j3 should have opposite 
signs. It is natural to put a > and (3 < 0. 

For the free energy density /(fci), linear in curvature, the integral 

(9) gives just the relation between the integration constants M 2 and C 2 : 
M 2 [l-(C/pM) 2 } = a 2 . 

When a = 0, Eq. (8) implies that the curvature ki(s) is an arbitrary 
function of s and the integration constant C vanishes. In this case Eq. 
(7) yields fc 2 = 0. Hence, for a = 0, the solutions to the Euler-Lagrange 
equations are arbitrary plane curves, which is evidently unacceptable for 
our purpose. 

Finally, requiring that the stationary curves for the functional (3) are 
only helices, we uniquely determine the free energy density, namely it should 
be a linear function of the curvature 

f(k 1 ) = a-\(3\k 1 (s) (18) 

with nonzero constants a and /?, providing a > and (3 < 0. From the 
mathematical stand point, we have resolved in fact the inversion variational 
problem, i.e., proceeding from the specified solutions to the variational 
Euler-Lagrange equations (helical curves) we have uniquely recovered the 
respective functional. 

Closing this section we would like to make the following note. It is well 
known, that in the protein physics the chirality property of these molecules 
is important. At the first glance we have ignored this point because from 
the very beginning we have eliminated the dependence of the free energy 
density / on the curve torsion. But the fact is the integration constant C in 
Eq. (7) is responsible for the chirality property of the helical curves under 
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consideration. Really, this constant is an analog of the Pauli-Lubanski 
pseudoscalar 6, 12,13 (in the three dimensional space-time the Pauli-Lubanski 
vector reduces to the pseudoscalar). The positive and negative values of 
this constant distinguish the left-hand and right-hand chirality of the helical 
curves. 

3. Conclusion 

Proceeding from rather general principles and making use of the basic facts 
concerning the conformation of globular proteins we have obtained, in a 
unique way, a geometrical model for phenomenological description of the 
free energy of helical proteins. It is worth noting that our functional (18) 
should be considered as an effective free energy of the helical protein which 
already takes into account the n atomic interactions within the protein and 
with the solvent. Hence, there is no need to quantize it, as one proceeds in 
the random walk studies. 9,10,11 

Certainly our simple model does not pretend to describe all the aspects 
of the protein physics. However, one can hope that it could be employed, 
for example, in Monte Carlo simulation to search for a stable native state 
of the protein. In this case the model can be used for the description of 
the free energy of individual parts (blocks) of a protein chain that have 
the helical form. Without any doubt, it should result in simplification and 
acceleration of the exhausting searching of the native stable state of the 
protein chain by a computer. 1 ' 2 
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